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Abstract. We propose a new method of analyzing the asymptotics of moments 
of certain hnear random recurrences which is based on the technique of iterative 
functions. By using the method, we show that the moments of the number of collisions 
and the absorption time in the Poisson-Dirichlet coalescent behave like the powers 
of the "log star" function which grows slower than any iteration of the logarithm, 
and thereby prove a weak law of large numbers. Finally, we discuss merits and 
limitations of the method and give several examples related to beta coalescents, 
recursive algorithms and random trees. 



1. Introduction and main result. 

. A linear random recurrence is a sequence of random variables {Xn,n £ N} which 
satisfies the distributional equality 

K 

(1) ^1=0, = K + n>2, 

where X„ is some parameter of a problem of size n, which splits into K > 1 sub- 
problems of random sizes /^"^ G {!,...,«}. For every r = 1,...,K, the sequence 

{xj^\k G N} which corresponds to the contribution of subgroup r is a distributional 
copy of {Xk, k £ N}, Vn is a. random toll term, and Ar{n) > is a random weight of 
subgroup r. It is assumed that {(-^"i), • ■ • , -^(k); ■ • ■ : AK{n),Vn), n > 2}, {Xn\n e 

N}, . . . , {Xii^\n G N} are independent. 

Random recurrences ([T]), often in a simplified form with K ~ 1, arise in diverse 
areas of applied probability such as random regenerative structures [HI E] , random trees 
[5J [71 [151 , coalescent theory [SI [TUl HH [H] , absorption times in non- increasing Markov 
chains [T31 [2] , recursive algorithms [121 [211 [23 [2H] ) random walks with barrier [T71 [TB] , 
to name but a few. 

The first step of asymptotic analysis of recurrences ([Ij is to find the asymptotics of 
moments EX^ and central moments E(X„ — EX„)'°, as n — > c>o. This problem reduces 
to studying the recurrent equations of the form 

ri-l 

(2) oi = 0, a„ fe„ + ^ CnfcOfe, n>2, 

k=l 

where n G N} and {cnk, n £ N,k < n} are given numeric sequences. The purpose of 
the present paper is to propose a new method of obtaining the first-order asymptotics of 
solutions to ([2]), as n — > oo. 

Although the asymptotic analysis of recurrences ([2|) is a hard analytic problem, some 
more or less effective methods have been elaborated to date. Evidently the most popular 
existing approach is the method of singular analysis of generating functions [H [8] . The 
method gives a very precise information on the asymptotic behavior whenever there is a 
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tractable functional relation between the generating functions of the sequences involved. 
The idea of a repertoire method proposed in |15| can be briefly described as follows. First 
we build up a repertoire {6^"'', a G A}, where A is a finite set, of inhomogeneous terms 
of (l2|) by choosing sequences {a^\ n G N} such that the sum in ([2|) is tractable. Then 
we construct the solution a„ to ([2]) with inhomogeneous term &„ as a linear combination 
of solutions ai^\ a G A. Finally, we mention a method proposed in [Ij and further 
developed in [28] which is based on the harmonic analysis and potential theory. 

Initially, our method which is based on the technique of iterative functions was devel- 
oped in order to find the asymptotics of moments of the number of collisions X„ and the 
absorption time Tn in the Poisson-Dirichlet coalescent (this problem was raised by Mar- 
tin Mohle in the early 2008). Below we prove that both EX^ and ET,^, fc G N, behave like 
the powers of the " log star" function which grows slower than any iteration of the loga- 
rithrrQ. This somewhat exotic behavior of the moments partially explains the fact that 
we have not been able to apply either of previously known (to us) methods to tackle the 
problem. To our knowledge, the "log star" asymptotics arises not often. In particular, we 
are only aware of two applied models which exhibit such a behavior: (a) the number of 
distinguishable alleles according to the Ohta-Kimura model of neutral mutation , and 
(b) the average complexity of Delaunay triangulation of the Euclidian minimum span- 
ning tree [4] . The number of collisions and the absorption time in the Poisson-Dirichlet 
coalescent are interesting, yet particular patterns of recurrence ([!]). Thus after having 
settled the original problem concerning the Poisson-Dirichlet coalescent, our method was 
subsequently extended to cover many other linear recurrences ([T]). 

In this paper, unless stated the contrary, we tacitly suppose that b„ > and, hence, 
«n > 0. However, a perusal of the proofs given below reveals that we could have as- 
sumed that bn is only non-negative or non-positive for large enough n. Under this last 
assumption, formulations of results would get cumbersome which has forced us to keep 
less generality but more transparency. 

Our method can be summarized in the following 

Algorithm 

1) Using, for example, the method described in [28^, obtain the recurrence with 
weights reduced to probabilities. As a result, we obtain the recurrence of the 
form 

Al = 0, An = Bn + y^^PnkAk, 
k=l 

where X]fc=i Pifc = ^ all ^ > 2 and Bn > 0. Let /„ be a random variable with 
distribution P{/„ = k} — Pnk, n > 2,k < n. 

2) Prove the divergence of An using, for example. Proposition 15 . II or other methods. 

3) Find a continuous, strictly increasing and unbounded function g(x) defined on M+, 
such that g(n) = E/„ -I- o(E/„). Pick an xq as defined in ([3]). Find a continuous 
function h{x) defined on M.~^ such that h{n) — Bn- 

4) Find an iterative function g* generated by the quadruple {h, g, xq, k), where k is 
any continuous on [0,2:0] function (see Definition 12. 

5) Using, for example. Theorem 15.21 find, if possible, an elementary function /i such 
that limj;^oo — 1, and set / := fi. Otherwise, select k such that g* is twice 
differentiable, and set / := g* (see Theorem 12. 

6) If /(E/„) — f{g{n)) = o{h{n)) then go to the next step, otherwise go to step 3) 
and choose asymptotically smaller term o(E/„). 

7) If E/(/„) - /(E/„) = o{h{n)) then An - /(n). If E/(/„) - /(E/„) ^ ch{n) then 
An^ {l-cy^f in) (see Theorem [321). 



-'^The result for EXn was conjectured by M. Mohle. 
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We mention, in passing, that iterative functions have aheady been used in the con- 
text of divide-and-conquer paradigm |19j- The cited paper is concerned with stochastic 
processes {T{x),x e R+} whose marginal distributions are given by the equahty 

T{x) = a{x) + T'{t{x)), X e R+, 

where a(-) is a non-negative (deterministic) function and t{-) is a random variable taking 
values in [0, •] which is independent of {T'{x), x G K^}, an independent copy of {r(a;), x e 
M+}. 

The structure of the paper is as follows. Section 2 introduces iterative functions and 
investigates their properties. Section 3 carefully describes the algorithm of our new 
method. Theorem 13.11 and Theorem 13.21 which are the main results of the section prove 
the validity of the algorithm. Section 4 is devoted to applications and also discusses "ins 
and outs" of the method. The paper closes with the Appendix which collects proofs of 
some technical results concerning the iterative functions and properties of recurrences 
®- 

Throughout the paper the notation r(-) ^ s(-) means that r{-)/s{-) 1, as the 
argument tends to infinity, C^™\B) denotes the space of functions which are m-times 
differentiable on the set B. li B = [a, oo) then the derivatives at point a are assumed to 
be the right derivatives. Also we use notation 

r°(°)(x) =^ X, r°«(:E) r(r°('=-i) (x)), k £ N. 

Finally, we recall the standard notation \x\ — sup{fc G Z : fc < x} and \x~\ = mi{k G Z : 
k > x} for the fioor and ceiling function, respectively. 

2. Iterative functions. 

In this section iterative functions are defined and some basic properties of these func- 
tions are given. We start with a formal definition. 

Definition 2.1. Suppose that the function g : K.+ — > M+ is increasing, unbounded and 
continuous, and satisfies the following condition: for some xq > and every xi > xq 
there exists > such that 

(3) X — g{x) > for all x ^ {xo,xi). 

Assume that the functions h : IR+ — > M+ and k : [0,a;o] M are continuous and define 
the function g* : R+ — !• R by the following equality 

mo{x) 

(4) g*{x) = J2 + kig^^^oi^^Hx)), 

1=1 

where 

mo(x) := inf{fc > : g°^'''>ix) < xq}. 
We call g* the iterative function generated by the quadruple {h, g, xo, k) and denote it by 
g* = Iter(ft,,5,a;o, fc). 

Note that technical condition ([3]) is sufficient for mo(x) to be finite for every x G M+. 
This follows from the estimate mQ{x) < [ ^~^° J + 1, which is implied by the inequality 

X - ksx > g°'-''\x), x>xo, keN, 
which in turn can be obtained by induction. 

Remark 2.1. From the definition it follows that g* satisfies the functional equation 

(5) g*{x) = h{x) + g*{g{x)), X > xo, 
with initial condition 

g*{x) = k{x), X < Xq. 

Below are some examples of iterative functions 
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. Example 1. Let h{x) — 1, g{x) ~ ax, a G [0,1), a;o = 1; k{x) = 0. Then g*{x) = 
1 + g*(ax), X > 1, or 

g*{x) = [logi x], X > 1. 

. Example 2. Let h{x) = 1, g{x) — logs, xo — 1, /c(x) = 0. Then g*{x) = 1 + 
(7*(logx), a; > 1, or 

g*{x) = log*x, 

the log-star function which is arguably the best known non-trivial iterative function. It 
is clear that Iter(l, xq, 0) — m^{x). In particular, this equality holds for the log-star 
function. 

. If h{xo) 7^ then the iterative functions Iter(/i, 0) are piecewise continuous. 
We however prefer to work with smooth iterative functions which was the main rea- 
son for introducing functions k in Definition 12.11 It turns out that Iter(/i, xq, 0) 
and IteT{h,g,xo,k) have the same asymptotics, and an appropriate choice of k makes 
Iter(/i, g, xq, k) smooth enough. Below we formalize this statement and also describe how 
the mentioned smoothness can be obtained by the choice of k. 

Introduce the equivalence relation « on the set of iterative functions by the rule 

9i ~ 92 ^=^5i = Iter(/i,5,xo,fci), 33 = Iter(/i, g, xq, ^2)- 

This relation induces partitioning the set of iterative functions into the classes of equiv- 
alence. 

Definition 2.2. The equivalence class 

F := {F = lteT{h,g,XQ, k),k £ C[0,a;o]} 

is called the iterative function generated by the triple {h,g,xo). When it does not lead 
to ambiguity, we call an iterative function generated by the triple {h,g,xo) an arbitrary 
element of this class. 

Since \gi{x) — gJl^)! bounded on K+, for any 51,52 ^ F, all iterative function in 
the same equivalence class are asymptotically equivalent (provided they diverge). 

Definition 2.3. An m-times differentiable modification of iterative function g* is an 
arbitrary iterative function g* such that g* w g* and g* £ C^™-' [xq, +00). 

Our first result which is a direct consequence of Lemma 15.11 and Lemma 15.21 given in 
the Appendix shows that provided g and h are smooth enough one can find a function k 
such that the function Iter(/i, 5, xq, k) is smooth. For a collection of functions /i, . . . , /„ 
let W{fi, . . . , /„) denote its wronskian. 

Theorem 2.1. Assume that g,h £ C'-™-'[a;o, +00) and that 

W(x' - g\x), z = 0, . . . , m + 1) (xq) ^ 0. 
Then there exists a function k of the form 

m+l 

k{x) — aix\ 

i=l 

such that the iterative function generated by the quadruple (h, g,XQ,k) is m-times differ- 
entiable on [xo,+oo). 

Remark 2.2. The vector of coefficients (ai, a2, . . . , am+i) is a solution to the system of 
linear equations (see Lemma l5.2p and can be calculated explicitly. 

An example of a smoothed iterative function is given next. 
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Example 2.1. Recall that the log-star function is iterative function generated by the 
quadruple (1, log 1, 0). A twice difFerentiable modification F of the log-star function 
can be constructed as follows. According to Lemma I5.2[ the corresponding function k 
takes the form k{x) — —-^x^ + f^x^ + ^x. Therefore 



Fix) 



1 + F(loga:;), a; > 1, 



^x^ 

13 



13 



[0,1] 



Below are depicted the graphs of functions log* x and F{x) for a; > 0. 




3. Asymptotic behavior of ([2]). 

While investigating recurrence ([2]), without loss of generality, we can assume that for 
every n > 2 

n-l 

(6) ^ c„fc = 1 and c„fc > 0, fc = 1, . . . , n - 1 

k=l 

(see, for instance, p. 9 in '55'). In what follows, recurrences © with &„ > which satisfy 
([S]) are referred to as recurrences with weights reduced to probabilities. If © holds, denote 
by /„ a random variable with distribution 

P{/„ = fc} = c„fe, fc = l,...,n-l. 

Theorem 3.1. Assume that the sequence {a„,n G N} satisfy recurrence Q with weights 
reduced to probabilities. Let g : M+ — > be a continuous, increasing and unbounded 
function such that 

g{n) = E/„ + o(E/„), n oo, 
and h : — )• be a continuous function such that 



h{n) — 6„, n>2. 



If 



• lim a„ = +CX), 

n— >-oo 

• g*(E/„) - g*{g{n)) = o{h{n)), n ^ oo, 

where g* is an iterative function generated by the triple {h,g,xo), then the following 
implications are true 

(7) Eg*{I„)-g*{EIr,)=o{h{n)), n ^ oo ^ 

a„ ~ 9*{n), n oo. 
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(8) Eg*{In) - g*{EIn) dh{n), n ~¥ oo, for some d < 1 =^ 

In ^ (1 ^ d)^^ g* (n), n — > oo. 

Proof. Set a'„ := a„ — g*{n), n G N. The sequence {a[^,n G N} satisfies the recurrence 

n-l 

(9) a'i = -5*(l), a'„^br,-g*{n)+Eg*{I,,) + J2cnka'k, n>2. 

k=l 

If Eg*(/„)-g*(E/„) = o{h{n)) and g*(E/„)-g*(g(n)) = o{h{n)) then the inhomogeneous 
term of ([9]) is o{h{n)). Therefore, applying part (II) of Theorem 15.11 yields = o(a„) 
which implies a„ g*{n). 

If E5*{/„) - (?*(£/„) - dh{n) for some d £ (0, 1) and g*(E/„) - g*{g{n)) = o(/i(n)) 
then the inhomogeneous term of ([9]) is asymptotically equal to dh{n). Therefore, applying 
part (I) of Theorem 15.11 yields a'^ ~ dttn which implies a„ (1 — d)^^g*{n). 

Finally, if E(7*(/„) — (7*(E/„) ^ dh{n) for some d < we can apply part (II) of Theorem 
15.11 to the sequences {g*{n) — a„} and {a„} to conclude that g*{n) — an —dan- The 
latter is equiyalent to a„ ^ (1 — d)~^g*{n). The proof is complete. □ 

Theorem 3.2. Assume that the sequence {a„,n G N} satisfy recurrence ([2]) with weights 
reduced to probabilities. Let g : — > M+ be a twice differentiable, increasing and 
unbounded function such that 

g{n) = E/„ + o(E/„), n -> oo, 

and h : — ?> 6e a twice differentiable function such that 

h{n) — bn, n > 2. 

If the following conditions hold 
(CI) lim a„ = +O0 

(C2) There exists a continuous function k such that the iterative function F generated 

by the quadruple (h,g,XQ,k) is twice differentiable 
(C3) F(E/„) - F{g{n)) = o{h{n)), n ^ oo 
(C4) There exists M > such that for all n E N 

Var/„ < MEIn 

Var / 

(C5) lim sup^>E/ /2 \F"{x)\ " = 
n^oo - n[n) 

(C6) hm — — 

n^oo h[n)\ai In 

,C7) li»^^ = 0, 

then 

On ~ F{n), n — > oo. 

Proof. Since conditions (CI) and (C3) hold, according to implication ([7]) in Theorem l3.1l 
it is enough to show that 

an := E^^(/„) - F(E/„) = o(/i(n)). 

With K 2^ and A„ {|/„ — E/„| > KVar/„} we haye 

< |E(i^(/„)-F(E/„))Uj + |E(F(/„)-F(E/„))U.| =:/3„+7«- 

An application of Chebyshey's inequality yields 

, , , . 2Var/„ supi<„<„ |i^(a;)| 

/3„<2 sup F x P A„ < » ' 

l<a;<« (KVar/„)^ 

which is o{h{n)) by condition (C6). 
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Using the Taylor expansion around E/„ leads to 



7n = 
< 



E(i^'(E/„)(/„ - EIn) + iF"(e„)(/„ - EInf) Uc 

F'(E/„)E(/„ - E/„)1a„ I + ^\EF"{6n){In - ElnfU-^^ 



71, n + 72,' 



where 6'„ G [E/„ — KVar /„ . E/„ + KVar /„] . Consequently, by Cauchy-Shwartz and Cheby- 
shev's inequalities we obtain 



7i,„ = |F'(E/„)E(/„ -E/„)UJ < |^^'(E/„)|v/E(/„ -E/„)VlP(^n) 



Var /„ 1 



which is o{h{n)) by condition (C7). 

Finally, an appeal to condition (C4) allows us to conclude that 

1 



^4 < ^ sup |F"(a:)|Var/„ 

^ x>EI„/2 

which is o{h{n)) by condition (C5). The proof is complete. 



□ 

Theorem 13. II and Theorem 13.21 iustifv the Algorithm given in the Introduction. 

4. Applications. 

4.1. Exchangeable coalescents. 

4.1.1. Number of collisions in beta{a, 1)- coalescents. Let Xn be the number of collisions 
in beta(a, l)-coalescent, a > 0, restricted to the set {1, . . . ,n}. Many results concerning 
the asymptotics of EX^', fc S N, are known [21 [71 [Tl [H [5S] but we partially derive 
them again just in order to show how our method works. 

It is known (see, for instance, [T71 Section 7]) that the sequence {Xn,n g N} satisfy 
the distributional equality 

Ai=0, A„ = l + A/„, n>2, 
where /„ is a random variable with distribution 

(2-a)r(a+A;-l) 



P{I„ =n~k}= ^ "^' "ffi+t-'i) ' k = l,...,n-l, n>2. 



r(a)r(n+i) 
if a 7^ 2, and 

P{/„=n-fc} = - 1 fc = l,...,n-l, n>2, 

[hn - l){k + 1) 

where /i„ = X]fc=i k~^, if a = 2. 

By Proposition 15 . 1 1 it follows that lim EX^ = +oo. It is also clear that no reduction 

n— >oo 

of weights to probabilities in the recurrence is needed. 
Case < a < 1 [3[T3[17]. Since 

E/„ = 71- (1 - a)-i + o(l), 

we can choose 

gix) ^ X and hix) = 1. 

1 — a 

Then functional equation ([5|) has an elementary solution g*{x) = (1 — a)x. By Theorem 
[Xn EX„ - g*{n) - (1 - a)n. 
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Case a — 1 (Bolthausen-Sznitman coalescent) [T71 HH US] . Since 

E/„ = n-logn + 0(l), 

we can choose 

g{x) = X — \ogx and h{x) = 1. 
From the relation = l+o(l) + l5^^^^fj) and TheoremlST^it follows that Iter(/i, g, 2)(a 
. An application of Theorem 13. 2n gives EX„ ^ . 

Case a = 2 [16]. Since 

E/„ = n - + O 



logn \\og^n, 
we can choose 

g{x) = (^^ ~ io^) ^"^^ ^ ^' 

From the relation ^ log^ a; = l + o(l) + i log^ (^x — j;^^ and Theorem 15.21 we conclude 
that lteT{h, g,2){x) ^ log^ x. Direct calculations show that 

1 



and 



log'E/„-log"g(n) = 



lim ifElog^/„ -log^EJ 



log n 



which, in view of implication ([S]), yields EX„ ^ log^ n. 

4.1.2. Functionals acting on the Poisson-Dirichlet coalescent. Unlike the beta coales- 
cents, the asymptotics of the moments of the number of collisions X„ in the Poisson- 
Dirichlet coalescent does not seem to have been known so far (we refer to [23l [29] for 
extensive information about this particular coalescent with simultaneous multiple colli- 
sions). Recall that this fact has served as an initial motivation for developing the method 
reported in this article. 

It can be checked that the sequence {X„, n e N} satisfies the distributional equality 

Xi^O, Xn^l + Xi,^, n>2, 
where ^ 

P{/„ = fc} = --^-— s(n,fc), fc = l,...,n-l, n>2, 

[e\n - 9" 

s(n, k) is the unsigned Stirling number of the first kind, and [6]n — 6{9+l) . . . {6 + n—l). 
This implies that 

E/„ = 6ilog?i + 0(l), Var/„ = 6ilog7i 0(1). 

Set 

g(a;) := 6'loga;, h{x) = 1 and (7*(a;) = Iter(ft,, (7,a;o) 

for some xq > exp{29 V 1). Notice that the g* is a generalized log-star function, which 
can be defined via functional equation 

g*{x) = 1 + g*{eiogx), X > xq. 

Let F be a twice differentiable modification of the g* of the form 

1 + F(6'loga;), x > xq, 
aix^ + a2X^ + a^x, x G [0, xq]. 



F{x) 



^The only thing which may require verification is condition C3. In the present situation, E/„ —g(n) = 
0(1), and the derivative of F{x) = x/logx tends to zero, as x oo. Therefore, condition C3 follows 
by application of the mean value theorem. 
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for some constants ai, a2, 03. It follows that, for every fixed j G N, 
F'i^) = of- ^-7-T— ) and F"{x) - o 



, X log X ■ ■ ■ log"'-'' {x) J \x'^ (log xy ■ ■ ■ (log"'-"'-' {x)y 

An application of Theorem 13.21 yields KXn ^ g*{n) ^ F{n). Analogously, we obtain 

EX^~(5*(n))^ fceN. 

Other important functionals acting on the Poisson-Dirichlet coalescent are the absorp- 
tion time r„ (in the biological context, r„ is the time back to the most recent common 
ancestor of a sample of size n) and the total branch length L„ of the coalescent tree. 

Corresponding distributional recurrences are 

Ti = 0, r„^T„+T7„, n>2, 

Li = 0, Ln = riTn + Li^^, n>2, 

where t„ is a random variable with the exponential law with parameter f;„ = 1 — jjj— 
which is independent of everything else. 

Using induction on fc, the fact that lim g„ — 1 and Theorem 15.11 we conclude that 

ET,t^(g*(n))^ fc G N. 

An application of Chebyshev's inequality immediately leads to the following weak laws 
of large numbers. 



Theorem 4.1. As n 00, 



Ar„ f T„ p 

— > 1 and — —7' 1. 



g*{n) g*{n) 
As far as L„ is concerned, we can prove that 

ElJ; ~ fc!n^ fc e N. 

By the method of moments this immediately gives the following weak convergence result. 
Proposition 4.1. As n ^ 00, 

Ln d r 

>L, 

n 

where L is a random variable with the standard exponential law. 

In a recent preprint [SS] the same result was obtained by a different method. We thus 
omit further details. 

4.2. Examples from the analysis of algorithms. We will give new proofs of the 
results from [24], [21] and [27], respectively, by using our method. 

4.2.1. The Quickselect algorithm. Let Xn be the number of comparisons that the Quick- 
select algorithm needs to find min(a::i, . . . , x„) of a sample xi, . . . , x„. Then 

ATi = 0, A„ ^ n - 1 + A:/„ , n > 2, 
where /„ = J„ V 1, and J,i is uniformly distributed on {0, . . . , n — 1}. Since 

n-l 1 
2 n 

we can choose 

g[x) — — ^ — and h{x) — x ~ \. 

Then functional equation ([5]) has elementary solutions g*{x) = 2x+c, c G M. By Theorem 
O EXn g*{n) - 2n. 
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4.2.2. The depth of a random node in a random binary search tree. The corresponding 
recurrence is 

Xo = -1, Xi =0, X„ = 1 + X/„, n > 2, 
where P{/„ = fc} = 2k/n^ for A; e {1, . . . , n - 1} and P{/„ = 0} = 1/n. Since 

_ (n-l)(2n-l) 

we can choose 

g{x) = 2x/3 and h{x) — 1. 
According to Example 1, the corresponding iterative function is 

g*{x) = [log3 xl, x>l. 



Since hm (Elog+/„ - logn) = -1/2, it follows that lini (E/(/„) - /(E/„)) = 1 

log^ X 



2iog(3/2) ' where 



log(3/2)- 

Since /(a;) ~ then, according to the Algorithm, EX,i ^ 2 log(3/2)/(n) ^ 21ogn. 

4.2.3. The Quicksort algorithm. Let X„ denote the random number of comparisons 
needed to sort a list of length n by the Quicksort. Then Xq = Xi = 0, and 

X„^n-1 + + j„ , n > 2, 

where {X'^, n G Nq} is an independent copy of {X„, n G Nq} which is independent of /„ 
having the uniform distribution on {1, . . . , n}. Set a„ :— EX„, then oq = ai = and 

a„ = n — 1 + > — flfe, n > 2. 
■■^ n 

k=Q 

The reduction of weights to probabilities can be made by the substitution := a„/(n+l) 
which yields 

n-l 



, n-l v-^ 2(fc + 1) , 
n + 1 nin + 1 



A;=0 



Using the same arguments as in the previous example we obtain ^ 21ogn. Therefore, 
EX„ ~ 2rilogri, which is well known asymptotic for Quicksort. 

4.3. Limitations of the method. 

(a) An indispensable requirement of our method to work is the divergence of a„ , the 
solution to ([2])- In particular, our method cannot detect the convergence of a„ to 
a constant. 

(b) It may be difficult to guess which elementary function has the same asymptotics 
as a given iterative function. 

(c) If condition ([5]) holds for some c ^ 0, it may be hard to calculate the constant c 
explicitly. Therefore, it seems that a natural assumption for the method to work 
is ([7]) rather than (|8]). Condition (O holds if the solution is nearly linear and the 
variance of index /„ grows not too fast (precise statements are made in Theorem 
13. 2|) . For instance, the mean number of collisions in the Bolthausen-Sznitman and 
Poisson-Dirichlet coalescents exhibit the asymptotic behavior of this type. 



First result concerning the complexity of the (non-randomized) Quicksort algorithm with 0(n log n) 
asymptotic goes back to the pioneer work by Hoar |14l . For complete analysis of the Quicksort and its 
different modifications we refer to survey 1301 . 
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5. Appendix. 

5.1. Some properties of iterative functions. For the given strictly increasing contin- 
uous function g, there exists the unique inverse function g"^ which defines the sequence 
{An, n G No} as follows 

(10) Ao = 0, A, :=(.g-i)°(*-i)(a;o), * G N. 

Lemma 5.1. Assume that g,h,k G C^™-* [xq, +cxd) and F = Iter(/i, xq, fc) is m-times 
differentiahle at xq. Then F is m-times differ entiahle on [xo,+oo). 

Proof. We only treat the case m — 1, as, for m — 2,3,..., the proof is the same. 
Since is a sum of compositions of C*-^^ [xq, +00) functions, it is differentiahle on 
[xQ^-\-oo)\{Ai,i G N}. Therefore, we only have to check the continuity and differen- 
tiability at points {Ai, i G N}. 

First step. Proof of continuity. By the assumption, F is continuous at Ai = Xq, i.e., 

(11) k{xa)^h{xa) + k{g{xa)). 
For fixed fc > 2, we have from (|4|) 

fe-i 

(12) F{Au^Q) - ^%°(*-i)(Afe-0)) + fc(5°('=-i)(Afc-0)), 

i=i 

and 

k 

(13) F(Afc+0) = ^/i(.g°('-i)(A,+0)) + fc(.9°W(Afc-|-0)). 

1=1 

Use now (jlip and continuity of h and g to obtain 
F{Ak + 0) - F{Ak - 0) 

= h{g°^^-^\Ak)) + Kg^^'^HAu)) ~ Hg^'^-'HAk)) 

= h{xo) + k{g{xo)) - k{xo)^ 0. 
Second step. Proof of differentiability. The differentiability of F at xq implies that 

(14) k'{xo)^h'{x„) + k'{g{xo))g'{xo). 
For fc > 2, using ([H]) and (HSl) yields 

^ i— 1 

n(A.)^Jnn^^A(^,(,o,_i)(^))^,(^o(.)(,))^. 

Consequently, 

F'^{A^) ~ F'_{A^) 

= hm A/,(gO(fc-i)(^)) + fc(^o(fe)(^))_ 1- rffc(gO(.-i)(^^))^ 

a:-i.Afc+0 ax K-^Afc-O dx 

Set u{x) .g°(''"i)(x), then u{Au + 0) = u(Afe - 0) = u{Au) = xq and 
F[{Ak) ~ F'_{Ak) 

— lim -—h(u(x)) + k(q(u(x))) — lim -—k(u(x)) 

x^A^+O dx x^Ak-O dx ^ ^ " 

lim (h!{u{x))^^k\g{u{x)))g'{u{x)))'u:{x)- lim k' (u(x)))v! [x) 

ic— ^yl^~(~0 — y A.]^ — 

= (/i'(a;o) -f k! {^g{x^))g\x^) - fc'(xo))u'(xo) = 0, 
by (HH) . The proof is complete. □ 
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From this lemma it follows that the function F is m-times differentiable provided it 
satisfies conditions 

k{xo) = h{xo) + k{g{xo)), 
^-[^g^ k'{xo) = h'{xo) + k'{g{xo))g'{xo), 

fcM(xo) = h'^"'Hxo) + {k{gixoW"'\ 
The following lemma proves the existence of such a function k(x). 



=^ 0. Then there 



Lemma 5.2. Assume that w(^x — g{x), . . . ,a;™+^ — 5™+^ (a;) 
exists a function k{x) — X^I^Ti^ ^i^^ which satisfies U5\). 

Proof. Plugging the representation k{x) = '^i^'' '^^^'^ CSl) gives the system of linear 

equations 

(ai{xQ - g{xQ)) + ... + a„,+i(x^+'^ - 5™+i(a;o))) = h{xo), 
lai±{x - g{x)) + ... + a,„+i _ g"^+\x))) = h'{xo), 

\ / X—Xn 



- gix)) + ...+ am+i£^{x"'+' - g"'+Hx))) = h("^\xo). 

V / X—Xq 

The determinant of this system is W{xq) which is not equal to zero by the assumption. 
Therefore, the system has a unique solution which implies that the function k is well 
defined and satisfies conditions dTSl). □ 



5.2. Inhomogeneous terms of recursion ([2|) and iterative functions. 
Theorem 5.1. Suppose that {a„,n G N} and {a'„,n £ N} satisfy the recurrences 

ri-l 

(16) an = &n + Pnkak, n> N, 

fe=i 

and 

ri-l 

(17) a'„ = bl^ + Y,Pnkak,n>N, 

k=l 

respectively. Suppose that bn > for n > N and lim a„ = +00. Then 

I. 6jj ^ bn, 71 — > cx) implies a!^ ~ a„, n — ^ 00, and 

II. 6Jj = o{bn), n ^ 00 implies a[^ = o(a„), n — 00. 

Proof of (I). We exploit the idea of proof of ^ Proposition 3]. Suppose there exists 
eo > such that a„ > {l + eo)a'^ for infinitely many n. Since lim o„ = +00, we can pick 

n— f 00 

e G (0,eo] such that for any c > the inequality a„ > (1 + £)aj^ + c holds for infinitely 
many n. Let Uc be the minimal such n. Since lim Uc — +00, without loss of generality 

we can assume that Uc > N . For n < Uc — I we have a„ < (1 + £)ajj + c which implies 



ric — 1 rtc — l 



2_, Pnaka'k- 



(1 + e)a'^^ + c < a„^ = + ^ Pn.kak < bn, + c + (1 + e) 

fc=i fc=i 

Simplifying the last expression gives 1 + e < bn,/b'n ■ Sending c — cxd leads to e < 0, 
which is a contradiction. Thus we have proved that 

lim sup — < 1 . 

n— >oo 0>n 

A symmetric argument proves the converse inequality for lim inf. 
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Proof of (II) proceeds by applying the already established part (I) to the sequences 
{a„,n e N} and {a^ — a^,n G N} and noting that the relation 6„ ^ &„ — h'^ implies 
«n ~ a„ — aj^. The proof is complete. □ 

Using a similar reasoning one can prove the following. 

Theorem 5.2. Let the triples [hi^g^xo) and {h2,g,XQ) generate the iterative functions 
fi and /2, respectively. Assume that lim fi{x) — +00. Then 

X^OQ 

I. h2{x) ^ hi{x), X — > 00 implies f2{x) ^ fi{x), x — > 00, and 

II. h2{x) — o{hi{x)), X ^ 00 implies f2{x) — o{fi{x)), x 00. 

5.3. Sufficient condition for the divergence of solutions to ([2]). A simple sufficient 
condition for lim a„ — +00 is given next. 

Proposition 5.1. Assume that the sequence {a„,n e N} satisfies If In 00 and 
liminf &„ = 6 > 0, then lim a„ — +00. 

n— )-oo n— foo 

Proof. From recurrence ([2]) we obtain 

n-l M-1 n-1 

an = + ^ Pnkttk = &n + X/ P"-k'^k + ^ PrifcAfe 

k=l k=l k=M 

M-1 n-1 

> bn+ ( inf flfe ) pnk + ( inf ) pnk- 

k=l ~ ~ k=M 

Sending n ^ 00 gives lim inf a„ > b + inf Ok . Letting M ^ 00 leads to lim inf a„ > 

n— ^00 k>Al n— >oo 

& + lim inf a„ which completes the proof. □ 

n— f 00 
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